® 




Europiisches Patentamt 
Eur pean Patent Office 
Office uropten des brevets 



© Publication number: 



0 046 669 

A1 



EUROPEAN PATENT APPUCATION 



© Application number: 8130382&4 
© Date of filing: 21.08.81 



©Int. CI. 3 : C 07 C 103/52 

C 12 N 15/00, G 01 N 1/20 
C 07 H 21/04, C 12 P 21/02 



© Priority: 25.0830 US 181046 


© Applicant: THE REGENTS OF THE UNIVERSITY OP 




CALIFORNIA 


© Date of publication of application : 


2200 University Avenue 


Berkeley, California 94720JUS) 


03.03.82 Bulletin 82/3 


© Inventor: Hobart, Peter 




© Designated Contracting States: 


2079 Golden Gate Drive 


AT BE CH DE FR GB IT U LU NL SE 


San Francisco California 941 15UJS) 




© Inventor: Crawford, Robert 




609 Gonzalez Drive 




San Francisco California 94132(US) 




© Inventor: Pictet, Raymond L Inst de Recherche en 




Btologie 




Moleculatre Unhrersite Tour 43 2, Place Jussfeu 




F-75221 Paris 7 Cedex 05IFR) 




© Inventor: Rutter, William J. 




80 Everson Street 




San Francisco California 94131 (US) 


.■ 


©Representative: De Minvteile-Devaux, Ian Benedict 




Peter et al, 




CARPMAELS & RANSFORD 43. Bloomsbiiry Square 




London WC1 A 2RA(GB) 


A 





© Somatostatin oi somatostatin precursors. 

Somatostatin precursors ere preprosomatostatin-1, pro- 
somatostatin-1, preprosomatostatin-2, prosomatostatin-2 
and somatostatin-2. DNA transfer vectors comprise a deox- 
ynucleotide sequence coding for a somatostatin or one of 
such precursors. 
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SOMATOSTATIN OR SOMATOSTATIN PRECURSORS 

The present invention relates to a novel somato- 
statin and novel somatostatin precursors, DNA transfer 
vectors containing a deoxynucleotide sequence coding for 
any one of these and microorganisms containing the trans- 
fer vector as well as a method for making the transfer 
vector. 

Specifically, the present invention provides 
for a somatostatin or somatostatin precursor selected 
from the group comprising preprosomatostatin-1, prosomato- 
statin-1, preprosomatostatin-2, prosomatostatin-2 and 
somatostatin-2^ 

The invention further provides for a DNA trans- 
fer vector comprising a deoxynucleotide sequence coding 
for a somatostatin or somatostatin precursor selected 
from the group comprising preprosomatostatin-1, prosomato- 
statin-1, preprosomatostatin-2, prosomatostatin-2 and 
somatostatin- 2. 

In one embodiment, the DNA transfer vector contains 
the deoxynucleotide sequence coding for somatostatin-2. 

In a further embodiment, the DNA transfer vec- 
tor containing the somatostatin-2 sequence comprises a 
plus strand having the sequence: 

5 » - GCT GGC TGT AAG AAC TTC TAT TGG AAG GGC 
TTC ACT TCC TGT TAA -3' 

wherein 

A is deoxyadenyl, 
G is deoxyguanyl, 
C is deoxycytosyl and 
T is thymidyl. 

In another embodiment, the DNA transfer vector 
contains a deoxynucleotide coding for preprosomato- 
statin-1. 

In a further embodiment, the DNA transfer 
vector containing the preprosomatostatin-1 sequence 
comprises a plus strand having the sequence: 
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5' - ATG AAG ATG GTC TCC TCC TCG CGC CTC CGC 
TGC CTC CTC GTG CTC CTG CTG TCC CTG ACC GCC TCC ATC AGC 
TGC TCC TTC GCC GGA CAG AGA GAC TCC AAA CTC CGC CTG CTG 

CTG CAC CGG TAC CCG CTG CAG GGC TCC AAA CAG GAC ATG ACT 
CGC TCC GCC TTG GCC GAG CTG CTC CTG TCG GAC CTC CTG CAG 
GGG GAG AAC GAG GCT CTG GAG GAG GAG AAC TTC CCT CTG GCC 
GAA GGA GGA CCC GAG GAC GCC CAC GCC GAC CTA GAG CGG GCC 
GCC AGC GGG GGG CCT CTG CTC GCC CCC CGG GAG AGA AAG GCC 
GGC TGC AAG AAC TTC TTC TGG AAA ACC TTC ACC TCC TGC TGA 
- 3' 

wherein 

A is deoxyadenyl, 
G is deoxyguanyl, 
C is deoxycytosyl and 
T is thymidyl. 

In an additional embodiment, the DMA transfer 
vector contains the deoxynucleotide sequence coding for 
preprosoma tostatin-2 . 

In a further embodiment, the DWA transfer vec- 
tor containing the preprosoma tostatin-2 sequence com- 
prises a plus strand having the sequence: 

5' - ATG CAG TGT ATC CGT TGT CCC GCC ATC TTG 
GCT CTC CTG GCG TTG GTT CTG TGC GGC CCA AGT GTT TCC TCC 
CAG CTC GAC AGA GAG CAG AGC GAC AAC CAG GAC CTG GAC CTG 
GAG CTG CGT CAG CAC TGG CTG CTG GAG AGA GCC CGG AGC GCC 
GGA CTC CTG TCC CAG GAG TGG AGT AAA CGG GCG GTG GAG GAG 
CTG CTG GCT CAG ATG TCT CTG CCA GAG GCC ACG TTC CAG CGG 
GAG GCG GAG GAC GCG TCC ATG GCA AC A GAA GGA CGG ATG AAC 
CTA GAG CGG TCC GTG GAC TCT ACC AAC AAC CTA CCC CCT CGT 
GAG CGT AAA GCT GGC TGT AAG AAC TTC TAT TGG AAG GGC TTC 
ACT TCC TGT TAA - 3 ' 

wherein 

A is deoxyadenyl, 
G is deoxyguanyl, 
C is deoxycytosyl and 
T is thymidyl. 
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In a still further embodiment the .DNA transfer 
vector contains a deoxynucleotide sequence comprising a 
DNA strand haying a deoxynucleotide sequence complemen- 
tary to the ribonucleotide sequence of mRNA coding for 
5 said somatostatin or somatostatin precursor. 

In a further embodiment the DNA transfer vec- 
tor is pBR322/pLaSl or pBR322/pLaS2. 

The DNA transfer vectors of the invention may 
be used to transform microorganisms and the invention 
10 provides for microorganisms transformed by these trans- 
fer vectors . 

The invention also provides for a process for 
preparing a DNA transfer vector containing a sequence 
coding for a somatostatin or a somatostatin precursor 
15 characterized by reacting said deoxynucleotide sequence 
with a DNA molecule prepared by cleaving a transfer vec- 
tor with a restriction enzyme. 

In one embodiment, a method for making the 
plasmid pBR322/pLaSl or pBR322/pLaS2 is characterized 
20 in that the DNA molecule prepared by cleaving a transfer 
vector with a restriction enzyme is pBR322 and the 
restriction enzyme is Pst I. 

The invention further provides for a method 
for preparing an expression transfer vector characteri- 
25 zed in that the DNA molecule prepared by cleaving a 

transfer vector with a restriction enzyme is cleaved at 
a point within an expressible control region. 

The invention also provides for a method for 
making a fusion protein comprising the, amino acid 
30 sequence of a somatostatin or a somatostatin precursor 
as its C-terminal end and a portion of a procaryotic 
protein as its N-terminal end characterized by incubat- 
ing a microorganism transformed by an expression vector. 

The invention further provides for a process 
35 for synthesizing a somatostatin or a somatostatin 
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precursor selected from the group comprising prepro- 
somatostatin-1, preprosomatostatin-2 and somatostatin- 2 
characterized by incubating a microorganism transformed 
by an expression vector comprising a deoxynucleotide 
5 sequence coding for said somatostatin or somatostatin 
precursor under conditions suitable for expression of 
said deoxynucleotide sequence and purifying said 
somatostatin or somatostatin precursor from the lysate 
or culture medium of said microorganism. 
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Somatostatin was originally isolated from the 
mammalian hypothalamus and shown to be an inhibitor of 
the secretion of growth hormone. Vale, W. r et al. r 
C.R. Acad, Sci. Paris , 275 , 2913 (1972) . The hormone is 
5 a peptide of 14 amino acids length* The amino acid 
sequence of human Somatostatin is AlaGlyCysLysAsnPhe 
PheTrpLysThrPheThrSerCys . The peptide may be linear or 
cyclic, by disulfide bond formation between the two Cys 
residues, but either form is active. More recent work 

10 has shown that the hormone is more widespread throughout 
the body and has a broader range of regulatory activities 
than was previously supposed. In particular. Somatostatin 
has been localized in the D cells of the endocrine 
pancreas and has been shown to be an inhibitor of both 

15 insulin and glucagon secretion. In addition, Somatostatin 
has been detected in discrete epithelial cells lining 
the gut where it functions to inhibit the secretion of 
several gastrointestinal hormones including secretin, 
gastrin and choleocystikinin. However, Somatostatin is 

20 not universally involved in -regulating the secretion of 
peptide hormones, since it has no effect on the 
secretion of follicle stimulating hormone, luteinizing 
hormone or thyrotropin. In addition to its regulatory 
role in the production of certain hormones, there is 

25 also evidence suggesting that Somatostatin functions as 
a neurotransmitter. 

The widespread localization and breadth of 
this hormone's regulatory activity indicate that the 
somatostatin activity is of fundamental physiological 

30 significance. However, there is no evidence that the 
observed regulatory effects are everywhere exerted by 
the same compound. Indeed, peptides up to 10 times the 
size of Somatostatin have been found which react with 
Somatostatin antibodies. See Conlon, J. et al., 

35 FEBS Letts .. 94, 327 (1978) and Noe, B. et al., 
Endocrinology~ 102, 1675 (1978) . One such larger 
peptide has been shown able to inhibit secretion of 
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growth hormone by the pituitary. Recently, a 28 residue 
peptide containing the somatostatin sequence has been 
isolated from porcine intestinal tissue and sequenced 
(Pradagrol, L., FEBS Letts ,. 109, 55 (1980)). These and 
5 other observations have prompted the suggestion that 
Somatostatin, in common with a number of other peptide 
hormones, may be synthesized as part of a large precursor 
peptide that is subsequently processed. Such processing 
could modify the known somatostatin activities, 

10 resulting in enhanced specificity or selectivity* 

Analogs of Somatostatin have been chemically 
synthesized. Such analogs have altered relative 
activities in their effects on different hormones when 
compared with Somatostatin itself. Thus, the somato- 

15 statin analogs and peptides comprising the somatostatin 
sequence are known to have somatostatin activity 
although the relative selectivity and specificity of 
these activities may be different from Somatostatin 
itself. The magnitude of such differences may not be 

20 predictable from knowledge of the structure alone. As 
a general principle, it would be expected that larger 
proteins comprising the somatostatin sequence display 
more specificity of activity. In fact, the emerging 
picture is that Somatostatin itself is but one of a 

25 family of related peptides, all of whom function to 
inhibit the secretion of peptides, especially peptide 
hormones, but which may differ in amino acid sequence, 
depending upon their -source and target tissue. We refer 
herein to this general class of peptides as somato- 

30 statins. 

Because of their known biological activities, 
the somatostatins have utility for the regulation of 
hormone output and for the treatment of disease 
conditions characterized by hormone overproduction. 
35 Somatostatin is subject to rapid degradation by 

proteases. In addition. Somatostatin itself is rapidly 
excreted following parenteral administration, making it 
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difficult to control the effective dose. Larger peptides 
are excreted more slowly. Therefore, higher molecular 
weight somatostatins provide the means for more readily 
obtaining and controlling a satisfactory effective dose. 
5 The somatostatins will be advantageous for the treatment 
of a broad range of hormone imbalance conditions. For 
example, a somatostatin selectively inhibiting glucagon 
but not insulin secretion would be beneficial for 
insulin-dependent diabetics who suffer ill effects from 
10 relative overproduction of glucagon, which exacerbates 
hyperglycemia. Also, proliferative retinopathy, which 
is a frequent pathology in diabetes, is linked to the 
overproduction of growth" hormone in such patients and 
therefore adult patients can benefit from administration 
15 of a somatostatin. Constant maximal secretion induced 
by hyperglycemia may be deleterious to B cells. Some 
individuals produce a small but insufficient quantity of 
endogeneous insulin at maximum capacity. These 
patients can benefit from administration of an exogenous 
20 somatostatin. Patients whose glucagon synthesis is 
relatively normal would benefit similarly from 
administration of a somatostatin lacking the activity of - 
inhibiting glucagon secretion. 

Somatostatin has been purified from various 
25 tissue sources and has also been synthesized by ordinary 
peptide chemistry techniques. In addition, Somatostatin 
has been synthesized by a microorganism using 
recombinant DNA techniques, Itakura, K. et al., Science , 
198, 1056 (1977). In that instance, the DNA coding for 
30 the amino acid sequence of Somatostatin, was chemically 
synthesized on the basis of the known amino acid 
sequence, and the coding relationships of the genetic 
code. The codons were selected on the basis of supposed 
compatibility with the host microorganism, E. coli. 
35 Until the present invention, we believe that the 

naturally-occurring DNA sequence coding for somatostatin 
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has never been isolated and characterized. Consequently, 
no information regarding precursor proteins was available* 

SUMMARY OF THE INVENTION 
The present invention discloses the cloning of 
5 cDNA containing naturally occurring sequences coding for 
somatostatins comprising both known and novel somato- 
statin sequences* For the purpose of this application, 
the following nomenclature is employed: 

Somatostatin 1 (SI) refers to the 14 amino acid 
10 peptide heretofore termed Somatostatin, supra, and in 
the prior art. 

Somatostatin 2 (S2) is a novel, naturally 
occurring somatostatin, 

Preprosomatostatin 1 (Propro SI) is a novel 
15 protein comprising SI as its C-terminal sequence, and 
having as its N-terminal sequence a sequence of 
hydrophobic amino acids characteristic of the signal 
peptide described for other precursors of secretory 
proteins, Devillers-Thiery, A. et al., Proc, Nat. Acad . 
20 Sci. USA , 72, 5016 (1975). 

Prosoma to statin 1 (Pro Si) results from the 
removal of the signal peptide from Prepro Si. 

Similarly, preprosomatostatin 2 (Prepro S2) 
and prosomatostatin 2 (Pro S2) are novel proteins 
25 comprising the S2 amino acid sequence at the carboxyl 
terminus and a signal peptide in the case of prepro S2 
or lacking the signal -peptide in the case of pro S2. 
Prepro SI has a molecular weight approximately the size 
of the "big somatostatin" molecule which has been 
30 detected in both mammalian and fish tissue, Conlon, J. 

* 

et al., supra , and Koe, B. et al., supra . 

Prepro Si and prepro S2 are considered to be 
naturally occurring precursor molecules from which 
active somatostatins, including pro SI, SI, pro S2 and 
35 s2 are derived by additional processing steps. 
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The present invention provides for the synthesis 
of the foregoing novel somatostatins in a microorganism 
transformed by cDNA -coding appropriately for the desired 
protein, said cDNA being inserted in a transfer vector 
5 suitable for obtaining expression of the cloned insert, 
either as a protein product or as a fusion protein 
coupled to a segment of bacterial protein. The present 
invention therefore includes the novel somatostatins 
per se, new methods of making both the novel somatostatins 
10 and SI, transfer vectors comprising inserted coding 
sequences for the foregoing somatostatins, new micro- 
organisms transformed by the above-mentioned transfer 
vectors, and new fusion proteins comprising the amino 
acid sequences of the foregoing somatostatins. 

15 DETAILED DESCRIPTION OF THE INVENTION 

Since a naturally occuring coding sequence 
comprising the coding region for SI had never previously 
been isolated, it was necessary to use as starting 
material a tissue known to have a sufficiently high 
20 proportion of somatostatin-producing cells to permit the 
practical isolation of mRNA sufficiently enriched in 

4 

somatostatin coding sequences. In the anglerfish 
(Lophius americanus) . the endocrine pancreas, termed 
Brockmann body, is easily separable from exocrine 

25 tissue and is highly enriched in D cell types (somato- 
statin-producing) relative to B cells (insulin- 
producing) . Furthermore, the amino acid sequence of SI 
isolated from anglerfish Brockmann body cells is 
identical to mammalian SI, Noe, B. et al.. Endocrinology , 

30 105, 1410 (1979) . 

A variety of cDNA inserts were synthesized 
from Brockmann body mRNA and cloned. The clones were 
then tested for their ability to cross-hybridize with 
each other, m this manner, it was determined that the 
35 majority of cDNA clones fell into one of three groups. 
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Subsequent analysis showed that these groups included 
cDNA coding for fish preproinsulin, prepro SI and, 
unexpectedly, the sequence coding for prepro S2. The 
techniques of the present invention have made extensive 
5 use of enzyme-catalyzed reactions. - 

RNA is isolated from Brockmann body cells 
using conventional techniques. Polyadenylated RNA is 
isolated by affinity chromatography. The polyadenylated 
RNA is used as the template for preparing a double 

10 stranded cDNA copy using conventional techniques. The 
first cDNA strand is synthesized using reverse 
transcriptase, an olig-dT primer and the polyadenylated 
RNA. After synthesis of the first strand of cDNA, the 
RNA is removed by alkali digestion and the single- 

15 stranded cDNA is used to self -prime the synthesis of the 
second strand by reverse transcriptase. The single- 
stranded hairpin loop is removed by digestion with SI 
nuclease. 

The cDNA is now ready for insertion into an 
20 appropriate transfer vector. Suitable transfer vectors 
include those which are capable of transforming 
bacteria, such as Escherichia coli and Bacillus subtilis , 
yeast ( Saccharomyces cerevisiae) , the inl, csp and esp 
mutant strains of Neurospora crassa and animal cells in 
25 tissue culture. Examples of transfer vectors suitable 
for use in this invention which are capable of 
transforming E. coli include: the pi asm ids pSClOl, 
pMB9, pBR313, pBR315,' pBR316 and pBR322 and the vectors 
derived from bacteriophage, i.e., Charon 3A, Charon 4 A, 
30 Charon 16A and J\.gtWES-^B. References which describe 
the preparation and characterizations of these transfer 
vectors are listed in Table 1. 
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Transfer Vector 
pSClOl 



pMB9 



pBR313 
PBR315 
pBK316 
pBR322 



Charon 3A 

Charon 4A 
Charon 16A 

7\>gtWES • ?c B 



Table 1 

Reference (s) 

Cohen, et al., Proc. Natl .Acad 
Sci.USA , 70 , 1293 and 3240 
(19-7) . *— 
Boyer, et al.. Recombinant 
Molecules , Beers, et al., 
Eds., Raven Press r NY, at 
p. 13 (1977). 

Cohen, et al., Recombinant 
Molecules , at p. 91. 

Rodriguez, et aL, Molecular 
Mechanisms in Control of 
Gene Expression (ICN-DCLA 
Symposium on Mol. & Cell . 
Biol. Vol. 5), Nieslich, 
et al., Eds., Academic 
Press, NY, at p. 471 (1976) . 

Bolivar, et al., Gene , 2, 75 
(1977). 

Bolivar et al,, supra. 

Rodriguez et al., supra . 

Rodriguez et al., supra . 

Bolivar et al.. Gene , 2, 95 

(1977) . 

Bolivar et al.. Gene , 121 

(1978) . 
Sutcliffe, Cold Spring 

Harbor Symp. on Quant . 
Biol ,, 43, 77 (1978). 
Blattner et al.. Science , 
196, 161 (1977) 



Blattner et al.", supra . 
Blattner et al., supra . 
Tremeier et al.. Nature , 
263, 526 (1976). 
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'XgtWES-^B Leaer et al.. Science , 19£, 

175 (1977) • 

Other suitable vectors for transforming E. coli are 
described in Sinsheimer, R.L. , Ann . Rev . Biochem . , 46 , 415 
5 (1977) . Suitable transfer vectors for transforming 
B. subtilis have been described by Iordanescu , S. , J. 
Bacteriol ., 124 , 597 (1964) and Ehrlich, S.D., ?roc . 
Natl . Acad . ScITuSA , 74, 1680 (1977) . One such vector has 
been identified as pTasmid pCl94. Hybrid plasmids 

10 containing yeast DNA — either plasmid and/or 

chromosomal DNA — and bacterial DNA — generally E. 
coli plasmid DNA — . are used to transform yeast. Such 
plasmids have been described by Beggs, J.D., Nature 
275 , 104 (1978); Asiav, C.L. and Carbon, J., Proc.Natl . 

15 Acad. Sci. USA 76, 3829 (1979); and Kingsman, A.K., et al*. 
Gene 7, 141 (1979) . Transfer vectors which can be 
utilized to transform animal cells in tissue culture 
include Adeno defective virus, SV-40 defective virus 
and polyoma virus. 

20 * The cDNA is inserted into a transfer vector 

using conventional techniques. The cDNA is usually 
inserted into any restriction site which is unique in 
the transfer vector. Charon 3A and 4A and 71 gt WES-PuB 
do not contain unique sites and, as a result, restriction 

25 sites of limited number are used for these vectors. The 
unique restriction sites in various transfer vectors and 
the useful restriction sites in vectors Charon 3A and 
4A and £gt WES.^B are shown in Table 2. 

Table 2 

Unique Restriction Sites 
EcoRi, Hind III, Sal I, 
Bam HI, Hpa I, Sma I 
EcoRi, Hind III, Sal I, 
Bam HI 



30 Transfer Vector 
pSClOl 

pMB9 
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pBR313 EcoRI, Hind III, Sal I, 

Barn HI, Hga I, Sma I, Xma I 

pBR315 EcoRI. Hind III, Sal i f 

Bam HI, Pst I 

5 PBR316 Hind III, Sal i. Bam HI, 

Pst I 

pBR322 EcoRI, Hind III, sal I, 

Bam HI, Pst I 
I 30194 Hind III 

10 Charon 16A EcoRI, Sst I 



15 



20 



25 



30 



Charon 3A 



EcoRI 



Charon 4A_ EcoRI 
X gt WES-^B EcoRI 

Generally, the cDNA is inserted into a restriction site 
within a phenotypic trait for ease in selection. For 
example, Hind III, Sal I and Bam HI in pSClOl, p mb9, 
PBR313, pBR315, pBR316, and pBR322 are within the gene 
for tetracycline resistance or its control region, 
insertion at this site results in loss of tetracycline 
resistance. The cDNA can be inserted into the transfer 
vector by the use of restriction site linkers or 
deoxynucleotide tailing. In the former, a synthetic 
nucleotide containing a recognition site for a 
particular restriction endonuclease, such as those 
discussed above, can be blunt-end ligated to the cDNA. 
The cDNA and transfer vector are separately incubated 
with the restriction endonuclease and then annealed to 
form the transfer vector containing the cDNA. in the 
latter, the cDNA can be tailed using a deoxynucleotide 
and terminal transferase as described by Roychoudhury, 
R., et al., Nucl. Acids Res .. 3, 863 (1976). The 
transfer vector, after digestion with a restriction 
endonuclease, can be tailed using the complementary 
deoxynucleotide in the same procedure. The tailed 
35 transfer vector and tailed cDNA are then annealed to 
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form the transfer vector containing the cDNA. For 
example, the cDNA can be dC-tailed and the transfer 
vector can be opened at the Pst I site and dG-tailed* 

The transfer vector containing the cDNA is 
5 then used to transform a suitable host. Hosts which are 
suitable for the various transfer vectors include B. coli , 
B. subtil is , yeast, N. crassa, and animal cells in 
tissue culture. The transfer vectors which are capable 
of transforming each of these hosts have been described 

10 above. Three mutant strains of E. coli are 

conventionally utilized. These are ^1776, ERI and 
HB101. E. coli %1776 has been described in Curtis, R. , 
Ann. Rev. Microbiol . 30 , 507 (1976) and U.S. Patent No* 
4,190,495. E. coli" ~RRI has been described in 

15 Dugaiczyk, A., et al., Molecular Mechanisms in Control of 
Gene Expression at p. 543. E. coli HB101 has been 
described in Kedes, D.H. and Thomas, C.A., Proc . Natl . 
Acad. Sci. USA 73 , 1537 (1976) . The suitable hosts are 
transformed by~~conventional techniques. For example, 

20 E. coli ^1776 is transformed by the transfer vector 

containing the cDNA as described by Cooke, N.E., et al., 
J.Biol.Chem . , 255 , 6502 11980). Colonies are selected 
and/or screened by conventional techniques. For 
example, colonies may be selected based on the loss of 

25 a phenotypic trait, such as tetracycline resistance- 
Colonies may be screened by (1) removing the cDNA by an 
appropriate restriction endonuclease and analyzing it 
by electrophoresis and hybridization (Southern, E.M., 
J.Mol.Biol . 98, 503 (1975)), or (2) replicating as 

30 described by^runstein, M. and Hogness, D.S., Proc. Natl . 
Acad. Sci. USA 72, 3961 (1975) and hybridizing with an 
appropriate problem, or (3) examining colonies directly 
for expression by RIA or other techniques. 

The cloned DNAs designated pLaSl and pLaS2 

35 are expressed in bacteria to yield either fusion 

proteins comprising the peptides coded by the inserted 
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sequences, or the peptides themselves. Several possible 
techniques are available as options, and may include (a) 
modification of the coding sequences to provide an 
exact desired translational starting point; (b) selection 
or construction of an optimal expression vector; and (c) 
post-translational processing, either by exploiting in 
vivo processing activity of the host or by in vitro 
chemical means . 

Where a fusion protein is expressed, 
modification of the cloned nucleotide sequence will 
generally be unnecessary as long as the resulting 
sequence permits translation of the insert in the correct 
reading frame and no stop codons intervene before the 
initial codon of the inserted sequence. Both SI and S2 
may be expressed conveniently as fusion proteins 
followed by post-translational in vitro processing, since 
both are joined to the rest of the Pro SI or Pro S2 
sequences by the dipeptide arg-lys. Such a sequence is 
especially susceptible to tryptic hydrolysis. Both SI 
and S2 are readily released by partial tryptic digestion 
of their respective precursors. 

Prepro SI and Prepro S2 are expressed as 
fusion proteins by insertion into appropriate sites 
within expressed operons including, for example, the 
25 Pst I site in the ^-lactamase gene of pBR322 (Villa- 
Komaroff , et al., Proc. Nat. Acad. Sci. USA , 75, 3727 
(1978) ) , the EcoRI site of pBR322 carrying the~lac 
control region and coding sequence for y#-galactosidase 
(Itakura, K., et al.. Science , 198, 1056 (1979)), or 
the Hind III site of the trpD gene of plasmid ptrpEDSO 
(Martial, J., et al.. Science , 205 , 602 (1979) )\ 
Modifications of sequence lengtEUy one or two 
nucleotides in order to achieve correct reading frame 
phase are well known in the art. Insertions at the 
35 Pst I site of pBR322, with the aid of the tailing 
procedure, occur in correct phase and reading frame 
with a, probability of 1/6. 
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Pro SI and Pro S2 are prepared from fusion 
proteins susceptible of specific cleavage in vitro . The 
cloned nucleotide sequences are modified to code for 
amino acid sequences providing specificity for a 
5 proteolytic enzyme. A useful sequence is AspAspAspAspLys, 
cleaved preferentially by the enzyme enterokinase, as 
described in copending application Ser. No. 125,878, 
filed February 29, 1980, incorporated herein by 
reference. As described therein, a linking nucleotide 

10 sequence coding for the foregoing amino acid sequence is 
inserted adjacent the nucleotide sequence coding for 
the desired amino terminus of Pro SI or Pro S2. 

Such insertion requires modification of pLaSl 
or pLaS2, by removal of nucleotides on the 3 1 end of the 

15 Pro Si or Pro S2 coding sequence. This will be 

accomplished either by controlled digestion of the 3 * 
end of the insert using 3J endonuclease or T4 DNA 
polymerase, or by the combination of restriction 
endonuclease cleavage at a point to the 5 f side of the 

20 desired starting point and chemical synthesis to restore 
that portion of the desired sequence thus removed. 
Controlled digestion is preferably carried out using 
T4 polymerase, which, in the absence of added 
deoxynucleotide triphosphates, catalyzes 3 1 to 5 1 

25 exonucleolytic digestion of double-stranded DNA 

(Englund, P.T., J. Biol. Chem ., 246 , 3269 (1971)). The 
extent of digestion is controlledTy selection of proper 
temperature, reaction time and amount of enzyme, 
according to principles well known in the art. 

30 Experimentation will be necessary in each instance, 

since optimal reaction conditions must be determined for 
each lot of enzyme and for each DNA to be modified. By 
these means, the extent of digestion can be controlled. 
Termination of digestion at a predetermined stopping 

35 point is achieved by including a single deoxynucleotide 
triphosphate in the reaction mixture, corresponding to 
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the desired stopping point. For example, in the 
presence of dATP, the DNA is digested 3'-5« until the 
polymerase reaches a dA residue, at which point further 
net digestion ceases. Several cycles of digestion, each 
with its predetermined stopping point, can be carried out 
in sequence, to construct DNA molecules having a 
predetermined end point. Exonucleolytic digestion with 
T4 polymerase affects only the strands having 3 • termini . 
The complementary strands remain as unpaired single- 
stranded tails, which must also be removed, si nuclease 
is -the preferred enzyme for the purpose. The product of 
combined treatment with T4 polymerase and SI nuclease is 
blunt-ended, double-stranded DNA. Chemical synthesis of 
deoxynucleotide sequences used to replace segments 
cleaved by restriction endonuclease digestion is carried 
out, for example, as described by Itakura, K. , et al., 
J. Biol. Chem., 250 , 4592 (1975) and Itakura, K., et al., 
J. Am. Chem. Soc , 9J7, 7327 (1975). 

Prepro SlTnd Prepro S2 are synthesized in 
similar fashion, expressed initially as fusion proteins. 

By the use of appropriate expression transfer 
vectors, the somatostatins of the present invention are 
also expressed directly, i.e., not fused to any 
procaryotic protein. In general, it is necessary to 
25 modify the cloned inserts, pLaSl and P LaS2, to remove 
the 5» untranslated regicr. preceding the desired 
starting point. VThere Prepro Si and Prepro S2 are the 
primary expression products, the other somatostatins are 
obtai ed from them. 

30 The underlying principle of direct expression 

is tl t the inserted DNA segment entirely replaces the 
codin segment normally transcribed and translated by 
the 1 cterial control region. The essential component 
of tl control region to be preserved is termed the 

35 exprc sion unit. The expression unit includes a 

promc ,-r and a ribosomal binding site capable of acting 
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in the host organism. As a practical matter, it is not 
necessary to remove precisely the nucleotides coding for 
the host portion of the fusion protein. The relationship 
between the ribosomal binding site and the start codon 
5 (AUG) is such that the start codon may be located 
anywhere within 3 to 11 nucleotides of the ribosomal 
binding site. Shine et al., Proc. Nat. Acad. Sci. USA , 
71, 1342 (1974); Steitz, J., et al., Proc. Nat. Acad , 
Sci USA , 72, 4734 (1975) . In this 3-11 nucleotide 

10 region, the first AUG to be encountered sets the reading 
frame for translation. In the case of ptrpE30, derived 
from ptrpEDSO described supra and containing the operator, 
promoter, attenuator and ribosome binding sequences of 
the tryptophan operon together with the nucleotide 

15 sequence coding for seven amino acids of the trpE 

protein followed by a Hind III site, the removal of a 
minimum of 23-29 nucleotides from the Hind III site 
provides a site for insertion into an expression unit 
under tryptophan operon control • 

20 For the direct expression of Prepro SI, for 

example, the original cDNA insert is modified as 
described above to remove the 5' untranslated region. 
A vector for direct expression can be constructed by 
modification of p trpE 30 by removing 23-29 nucleotides 

25 using T4 DNA polymerase and SI nuclease as described 
above. A linker nucleotide sequence containing the 
restriction sequence for Bam HI endonuclease is blunt- 
end ligated to both the modified cDNA insert and the 
modified ptrpE30 by the procedure of Valenzuela, et al., 

30 supra . This is done to facilitate insertion which is 
performed essentially as described by Ullrich, A., et 
al.. Science , 196 , 1313 (1977). Suitable hosts, such 
as E. coli HB10I7 RRI# or ^1776 or other bacteria 
are transformed by the recombinant vectors bearing the 

35 inserted Prepro SI coding region. Transformants are 
selected for resistance to ampicillin and then grown 
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under conditions suitable for expression of Prepro 
SI. 

Prepro SI and Prepro S2 are converted to Pro 
SI and Pro S2, respectively, by removal of the N-terroinal 
sequence of hydrophobic amino acids that comprise the 
signal peptide. In vitro removal of the signal peptide 
is carried out by treating the protein extracted from 
transformed, induced cells, with a preparation of "rough" 
microsomes, as described by Jackson, R.C., Blobel, G. 
(1977), Proc. Nat. Acad. Sci. VSA . 7£, 5598. In vivo 
removal of the signal peptide may occur during direct 
bacterial expression of the Prepro SI or Prepro S2 
coding sequence. Bacterial and mammalian signal peptides 
share sequence similarities. Proteins having mammalian 
signal peptides may be processed by bacterial cells, 
resulting in excretion of the propeptide (in this 
instance Pro SI) into the periplasmic space or into the 
medium. This occurrence of such bacterial processing is 
detectable by observing the occurrence of a protein in 
the medium or periplasmic space of induced cells, not 
found in uninduced cells, as measured by incorporation 
of a radioactive amino acid or by radioimmunoassay. 

Si and S2 themselves may be prepared by 
tryptic digestion of either Prepro SI or Pro SI and 
Prepro S2 or Pro S2, respectively, as described supra . 
Either whole cell extracts or partially purified protein 
preparations may be treated in this fashion to yield 
SI or S2. 

Prepro SI, Pro SI, SI, Prepro S2, Pro S2 and 
S2, synthesized as described, are purified by techniques 
well known in the art, including, for example, gel 
filtration, ion exchange chromatography, affinity 
chromatography and differential solubility techniques. 
The amino acid sequences are confirmed by conventional 
methods of amino acid sequence analysis. 
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Purified enzymes for use in the practice of 
the present invention are presently available from 
commercial sources. Restriction endonucleases, their 
nomenclature and site specificity have been described in 
5 detail by Roberts, R., Crit. Rev. Biochem ., 4_ r 123 (1976) 
The restriction enzymes used in this work were obtained 
from New England BioLabs, Beverly , Massachusetts, and 
used in amounts and under reaction conditions specified 
by the manufacturer for each enzyme. Reverse 

10 transcriptase was provided by Dr. J. Beard, Life 

Sciences, Inc., St. Petersburg, Florida. DNA polymerase 
I and polynucleotide kinase were obtained from New 
England BioLabs, Beverly, Massachusetts. Micrococcal 
SI nuclease was obtained from Miles Laboratories, 

15 Elkhart, Indiana. Bacterial alkaline phosphatase was 
obtained from Worthington Biochemical Corporation, 
Freehold, New Jersey. 

Example 1 

Synthesis of cDNA . RNA was extracted from 

20 anglerfish Brockmann bodies obtained from Biological 
Associates, Portland, Maine, by the guanidine 
thiocyanate method, Chirgwin et al.. Biochemistry , 24 , 
5194 (1979) . One gram of Brockmann body tissue yields 
approximately 6.5 mg of total RNA. Polyadenylated RNA 

25 was purified using oligo-dT-cellulose affinity 

chromatography, Aviv, H. et al., Proc. Nat. Acad. Sci . 
USA , 69 , 1409 (1972). 1.3 Mq of single-stranded cDNA 
was made from 10 Mq polyadenylated RNA using reverse 
transcriptase (Lot No. 1577) in the following reaction 

30 mix of 200 JUT* volume: 40mM tris-HCl, pH 8.3, lOmM 

MgCl 2 , 56 mM A -mere ap toe thanol, 0.5 mM dATP, dGTP, dTTP, 
0.15 mM 3 H-dCTP (sp. Act. 5.2 mCi/ /*g/ml polyadenylated 
RNA, 15 units reverse transcriptase A4q RNA. Incubation 
was at 40°C for 30 minutes and then at 68°C for two 

35 minutes. The reaction was stopped by the addition of 
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NaEDTA (pH 8.0) to 20mM and the reaction mixture v?as 
then chromatographed over G-75 Sephadex (trademark 
Pharmacia, Inc., Uppsala, Sweden) . The DNA-containing 
fraction was ethanol precipitated. RNA was removed by 
5 alkali treatment (0.2M NaOH, 23 min. at 68°C) and the 
second strand cDNA was carried out in the following 
reaction mix using DNA polymerase I: 100 mM Na KEPES 
pH 6.9, 75mM KC1, lOmM MgCl 2 , 10mM dithiothreitol (DTT), 
lOOmM each of the four deoxynucleotide triphosphates, 
10 0.8 Mg/wl single-stranded cDNA (boiled three minutes 
just prior to addition) , and 75 units E. coli DNA 
polymerase 1/ Hg DNA. Incubation was at 14 °C for two 
hours. The reaction volume was made 20mM NaEDTA, 

extracted once with phenolchloroformisoamyl alcohol 
15 (50:48:2 by volume) and passed over a G-75 Sephadex 

column. In preparation for dC tailing, the hairpin loops 
of the double-stranded DNA (ds-cDNA) were clipped and 
made blunt-ended using SI micrococcal nuclease. SI 
digestion was carried out at 37°C for 30 minutes in 
100 jaI reaction volume, containing 0.3M NaCl, 0.03M 
NaOAc , 3mM ZnCl 2 , pH 4.5, 175 SI units/ jUg ds-cDNA. 
After organic extraction and G-75 Sephadex chromatography 
to remove small DNA fragments, 300 ng of trimmed ds-cDNA 
was tailed using calf thymus terminal transferase in a 
25 30-minute reaction at 37 *C containing 140mM potassium 
cacodylate, 30mM tris base (pH 7.8) , ImM CoCl 2 , O.lmM 
DTT, 25 M dCTP ( 0{ 32 P-dCTP, 8.36 M Ci/nmole) , and 150 
units terminal transferase/ JULg ds-cDNA. The tailing 
reaction is described, generally, by Roychoudhury, R w 
et a3 "' Nucl. Acids Res ., 3, 101 (1976). * Preliminary 
experiments indicated approximately 30 bases are added 
to the 3 1 end under these conditions. Plasmid pBR322 
was cleaved by Pst I endonuclease and provided with dG 
tails by the previously described tailing reaction, 
35 except that dGTP was used instead of dCTP, and no 

radioactive label was employed. Equimolar amounts of 
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dc tailed cDNA and dG tailed pBR322 were annealed at a 
concentration of l'/tg/ml, using sequential two-hour 
incubations at 42°C, 30°C and 14 °C (36) . The hybrid 
plasmid DNA was ethanol precipitated and then used to 
5 transform E. coli /^1776 and selected for tetracycline 
resistance* To screen with various labeled probes, 
recombinant colonies were grown on Whatman 541 cellulose 
paper, lysed in situ and prepared for hybridization 
according to the procedure of Grunstein et al. and 

10 Proc. Nat. Acad. Sci. USA , 72, 3961 (1975). The filters 
were hybridized to a 32 P labeled cDNA probe synthesized 
from anglerfish Brockmann body polyA+RKA, and plasmid 
DNA was isolated from those colonies giving a strong 
autoradiographic signal using a procedure described by 

15 Staudenbauer, W.L., ttol. Gen. Genet ., 145 , 273 (1976). 
Several plasmid s were selected on the basis of their 
size as measured by gel electrophoresis and their cDNA 
inserts were purified on preparative acrylamide gel 
electrophoresis. Purified insert cDNA from these 

20 plasmids was labeled by nick translation and hybridized 
with other purified cDNAs in pair-wise combinations. 
By this means the inserts could be divided into a 
number of non-cross-hybridizing groups and their 
relative frequency estimated. The most frequently 

25 occurring insert among the colonies selected as 

described was shown by nucleotide sequence analysis to 
code for fish preproinsulln. Two additional groups were 
analyzed and specific inserts, designated pLaSl and 
pLaS2, were selected for analysis. 

30 The two selected inserts were analyzed to 

estimate the size of the corresponding mRNA specific to 
each, by the method of Alwine et al., Proc. Nat. Acad . 
Sci. USA , 74, 5350 (1977). The method entailed 
hybridization to anglerfish polyadenylated RNA which 

35 had been previously chromatographed on a 2% agarose 

methyl mercury gel (Bailey, J., et al.. Anal. Biochenu , 
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2£, 75 (1976)), and transferred to diazotized cellulose 
paper (Alwine, Jr., et al., supra). Hybridization was 
for 48 hours at 42°C. Hybridized fibers were 
autoradiographed after washing three tiroes (100 ml 
volume, 10 minutes each) in 0.3M NaCl, 0.03M NaCitrate, 
0.1% (w/v) SDS at 20«c and then in the same manner using 
5mM NaCl, 1.5mM NaCitrate 0.1% SDS at 50°C. The results 
are shown in Figure 1. Lane 1: Hind III digested 

and Hae III digested ^X174 DNA molecular weight 
standards; Lane 2: Ethidium bromide staining of 
anglerfish Brockmann body (1 Ag polyadenylated RNA) ; 
Lane 3: Hybridization of transferred anglerfish 
polyA+ to pLaSl 32 P insert DNA; Lane 4 : Hybridization 
of transferred anglerfish polyA+ RNA to P LaS2 32 p insert 
15 DNA. 

From the data of Figure 1, it can be seen that 
both pLaSl and P LaS2 hybridize specifically to mRNAs of 
approximately 700 bases, corresponding to the size of one 
major class of polyadenylated mRNA found in the 
Brockmann body. By a similar analysis, the other major 
polyadenylated mRNA of 840 bases was shown to correspond 
to inserts coding for preproinsulin. 
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Example 2 

The deoxynucleotide sequences of pLaSl and 
pLaS2 were analyzed by the method of Maxam, A. et al., 
Proc. Nat . Acad. Sci. USA . 74, 560 (1977) . The 
ribonucleotide sequences are - shown in Figure 3, 
together with the corresponding predicted amino acid 
sequences coded by the "sense" strands of the DNA, i.e., 
the strands corresponding in sequence to the respective 
mRNAs . The deoxynucleotide sequence of the sense strand 
is identical except that T replaces U. The correct 
reading frame is recognized by a lack of termination 
codons over a substantial portion of the insert and, in 
the case of pLaSl, by the fact that it correctly codes 
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for SI. The amino acid positions are numbered beginning 
with the amino-terminal amino acid of SI and proceeding , 
in the positive direction, to the carboxy terminal end of 
SI, and in the negative direction to the first AUG codon 
5 (position -107) , presumed to be the point of translation 
initiation. 

Beginning with the first AUG codon at the 5» 
end of the sense strand, translation of pLaSl predicts 
the sequence of a 121 amino acid peptide which contains 
10 the SI amino acid sequence at its carboxy terminus. The 
121 amino acid peptide is a heretofore undescribed 
somatostatin, termed prepro Si herein, having a molecular 

weight of 13,328. 

Our results suggest that, in common with many 

15 other hormones, the synthesis of Si involved post- 
translational processing. The translation of 
somatostatin mRNA yields a precursor (prepro SI) 
containing a signal peptide which may be released during 
the transit into the endoplasmic reticulum, and the 

20 resultant precursor (pro SI) is subsequently cleaved to 
yield SI itself. By analogy with other signal peptides, 
and in particular with that of angler fish insulin with 
which it shares sequence homology, the prepeptide 
portion of prepro SI is probably about 20-25 bases long. 

25 After removal of the signal peptide, the resultant pro 
SI molecule would have approximately 96-101 amino acid 
residues and a molecular weight of approximately 9,000. 
Pro SI is more than five times the size of somatostatin. 

The pLaS2 deoxynucleotide sequence was analyzed 

30 in a manner similar to that described for pLaSl. 

Translation of pLaS2 predicts the sequence of a 125 amino 
acid peptide which surprisingly contains a 14 amino acid 
sequence at its carboxy terminus which differs from Si by 
only two amino acids, and is termed Somatostatin 2 (S2) 

35 herein. Three novel somatostatins are therefore 

contained within the full sequence: S2, Pro S2, and 
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Prepro S2. The predicted molecular weights of Prepro S2 
and Pro S2 ar 14,100 and 11,600, respectively. Figure 3 
shows the corresponding ribonucleotide and amino acid 
sequences of pLaS2. The deoxynucleotide sequence is 
5 identical except that T replaces U. 

Example 3 

(A) Example 1 is repeated using plasmid 
pBR315 in place of plasmid pBR322. All conditions 
including selection of recombinant clones are as 
described. The inserts are removed and analyzed as 
described in Example 2 , yielding DNA 1 s of about 7 00 base 
pairs having the sequences shown in Figure 3. 

(B) Example 1 is repeated using plasmid 
pBR316 in place of plasmid pBR322. All conditions are 

15 identical to those described in Example 1. Removal and 
analysis of the inserts as described in Example 2 yields 
DNA 1 s having the sequences shown in Figure 3. 
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Example 4 

For this example, the cDNA is prepared as 
described in Example 1. All restriction site linkers 
utilized in this example and those that follow are 
prepared as described by Scheller, RvH. et al.,- Science 
196, 177 (1977). The restriction endonucleases used in 
this Example and those that follow are commercially 
25 available from New England Biolabs, Beverly, 

Massachusetts, and optical conditions according to 
supplier » s recommendations are utilized . Transformation 
of E. coli ^1776 is performed as described in 
Example 1. 

30 (A) Hind III linkers having the sequence 

5 B - CCAAGCTTGG - 3 9 are ligated to the cDNA prepared in 
Example 1 by blunt-end ligation using T4 DNA ligase as 
described by Valenzuela, P., et al.. Nature 280 , 815 
{1979). After ligation, the product is digested with 
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Hind III or Hsu I following the procedure described by 
Ullrich, A* et al., Science 196 , 1313 (1977). Plasmid 
pBR322 is cleaved at the Hind III restriction site with 
either Hind III or Hsu I and treated with alkaline 
5 phosphatase as described by Ullrich, A., et al., supra . 
After ethanol precipitation, the phosphatase-treated, 
cleaved pBR322 is ligated to the cDNA containing Hind III 
cohesive termini as described by Ulirich, A., et al., 
supra . " E. coli /^1776 is transformed with the ligation 

10 mixture and the transformed colonies are selected for 
tetracycline sensitivity. The presence of cloned DNA is 
determined as described in Example 1. Colonies 
containing inserts are selected as described in Example 1. 
The inserts are removed by digestion with Hind III or 

15 Hsu I and analyzed as described in Example 2. The 
inserts contain about 700 base pairs and have the 
sequences shown in Figure 3. 

(B) Example 4 (A) is repeated in which several 
vectors are utilized in place of pBR322. In this 

20 Example, the plasmids pSClOl, pMB9, pBR313, p3R315, and 
pBR316 are utilized in place of pBR322." All conditions 
including selection of recombinant clones are as 
described in (A). For all plasmids, identical results 
are obtained, i.e., recombinant plasmids containing 

25 inserts having the sequences shown in Figure 3 are 
obtained in each instance. 

(C) Example 4 {A} is repeated in which several 
other restriction site linkers and restriction 
endonucleases are utilized. In this Example, the 

30 linkers for Sal I and Bam HI are used in place of the 

linkers for Hind III. The sequences of these linkers are 
5' - GGTCGACC - 3" and 5 1 - CCGGATCCGG - 3». When SaT I 
linkers are used, the linker- treated cDNA and pBR322 are 
cleaved with restriction endonuclease Sal I and when 

35 Bam HI linkers are utilized, the restriction endonuclease 
is Bam HI. All conditions including selection of 
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recombinant clones are as described in 4 (A) . For both 
linkers and restriction endonucleases, identical results 
are obtained* 

(D) Example 4 (C) is repeated in which plasmids 
5 pSClOl, pMB9, pBR313, pBR315 and pBR316 are used in 
place of pBR322. All conditions are the same as 
described in Example 4(C) and identical results are 
obtained, i.e., in each instance inserts having the 
sequences shown in Figure 3 are obtained. 

10 (E) Example 4 (A) is repeated in which an Eco 

RI restriction linker having the sequence 5* - CCGAATTCGG 
- 3 f is used in place of the Hind III linker and Eco RI 
is used in place of Hind III ( Hsu I). All conditions 
are identical to those described except that transformed 

15 colonies are not selected by tetracycline sensitivity. 
The presence of cloned DNA is performed as described in 
Example 1. Recombinant clones containing inserts having 
the sequences shown in Figure 3 are obtained. 

(F) Example 4(E) is repeated in which plasmids. 
20 pSClOl, pMB9, pBR313 and pBR315 are used in place of 

pBR322. The same conditions as described in Example 

4 (E) are utilized and identical results are obtained for 

each plasmid. 

(G) Example 4 (a) is repeated in which a Pst I 
25 restriction linker having the sequence 5 1 - GCTGCAGC - 

3 1 and Pst I are used in place of the Hind III linker 
and Hind III ( Hsu I) , respectively. Identical conditions 
are employed except that selection of recombinant clones 
is accomplished as described in Example 1. Recombinant 
30 clones having the Figure 3 sequence are obtained. 

(H) Example 4 (G) is repeated in which plasmids 
pBR315 and pBR316 are used in place of pBR322. The same 
conditions as described in Example 4 (G) are utilized and 
identical results are obtained. 

3 5 (I) Example 1, 3(A), 3(B) and 4 (A) - (H) are 

repeated using E. coli RRI or E. coli HB101 in place of 
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E. coli ^1776. All conditions are as described and 
identical results are obtained. 

Example 5 

For this example, the cDNA is prepared as 
5 described in Example 1. Eco RI restriction site linkers 
are blunt-end ligated to the cDNA as described by 
Valenzuela, P., et al., supra , and cleaved with Eco RI 
as described by Ullrich, A., et al., supra. 

(A) Charon 16A DNA prepared as described by 
10 Blattner et al., supra , is utilized as the transfer 

vector* The cohesive ends are annealed by incubation for 
60 minutes at 42°C in 0.1M Tris-HCl, pH 8.0 and 10 mm 
MgCl 2 . The vector is cleaved at the Eco RI restriction 
site with Eco RI endonuclease and then treated with 
15 alkaline phosphatase as described by Ullrich, A., et al., 
supra . After ethanol precipitation, the phosphatase- 
treated vector DNA is added to cDNA containing Eco RI 
cohesive termini at a molar ratio of 2 moles vector to 
1 mole cDNA. The mixture is ligated with T4 DNA ligase 
20 as described by Ullrich, A., et al., supra . The ligation 
mixture is added' directly to a suspension of E. coli 
/^1776 cells prepared for transformation as described 
in Example 1 and the transformation is performed as 
described in Example 1. Recombinant phages are 
25 recovered and plated on Lac" bacteria on plates 

containing 5-chloro-4-bromo-3-indolyl- j& -D-glactoside 
(X6), (80 //g/ml) recombinant phages containing the 
cDNA inserted into the Eco RI site of Charon 16A are 
selected for the production of colorless plaques. The 
30 desired recombinants are isolated as described in 
Example 1 and digested with an excess of Eco RI 
endonuclease and the digest analyzed as described in 
Example 2. DNA's of about 700 base pairs in length and 
having the sequences shown in Figure 3 are recovered. 
35 Alternatively, the ligation mixture is utilized to form 
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recombinant phages by in vitro packaging as described by 
Sternberg, N. et al., Gene 1, 255 (1977). Recombinant 
phages can also be screened~"by utilizing the in situ 
plague hybridization technique of Benton, W.D. and Davis, 
5 R.W., Science 196 , 180 (1977). 

(B) Charon 3 A DNA or Charon 4A DNA, prepared 
as described by Blattner, et al., supra , is used as the 
transfer vector in place of the Charon 16A DNA used 
above. All conditions are identical to those described 

10 for Charon 16A DNA. Selection of recombinant phages is 
performed by (a) plating the phages on Lac + bacteria on 
plates containing X6 and isolating colorless plaques 
followed by either hybridization to a suitable probe or 
restriction endonuclease digestion as described by 
15 Blattner, et al. # supra . The inserts are removed as 

described above, yielding DNA 1 s of about 700 base pairs 
in length and having the sequences shown in Figures 1 
and 2. 

(C) J\.gt WES^AB DNA, prepared as described 
by Tiemier et al., supra , is utilized as the transfer 
vector in place of the Charon 16A used above. All 
conditions are identical to those described for Charon 
16A. Selection of recombinant phages is performed by 
the hybridization method of Benton and Davis, supra. 
The inserts are removed as described above, yielding 
DNA's of about 700 base pairs in length and having the 
sequences shown in Figure 3. 

(D) Examples 5 (A) - (C) are repeated in which 
E. coli rpi, e. coli HB101, E. coli DP50 or E. coli 
DPSOSupF is used in place of E. coli ^ 1776. All 
conditions are identical and identical results are 
obtained. 
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Example 6 

For this example, the cDNA is prepared as 
described in Example 1. Hind III restriction site 
linkers are added and the cDNA digested with Hind III or 
5 Hsu I as described in Example 4 (A) . 

The plasmid pCl94 isolated from S. aureus as 
described by Ehrlich, S.D., supra , is utilized as the 
transfer vector. pC194 is cleaved at the Hind III site 
with Hind III or Hsu I and then treated with alkaline 

10 phosphatase as described by Ullrich, A„ et al., supra . 
The cDNA having Hind III cohesive termini is ligated to 
the cleaved pC194 as described by Ullrich, A. , et al., 
supra » Competence-induction of' B. subtil is RUB331 
is performed "as described by Sgaramella, V,, et al., 

15 J.Mol.Biol . , 105, 587 (1976) and Borenstein, S. and 
Ephrati-ElizueT~E. , J.Mol .Biol ., 45, 137 (1969). 
Transformation of B. subtilis RUB~331 is performed using 
the ligation mixture as described by Sgaramella et al. 
and Borenstein et al. The cell suspension is plated 

20 directly on L plates containing 3 <4{q of chloramphenicol. 
Selected recombinants are isolated as described in 
Example 1 and digested with Hind III and the digest 
analyzed as described in Example 2. DXA's of about 700 
base pairs in length and having the sequences shown in 

25 Figure 3 are recovered. 

Ex ajnple 7 

Synthe si s of DNA coding for Prepro SI 
(A) The insert DNA is first separated from 
pBR322/pLaSl by Pst I enconuclease digestion, and 

30 purified by preparative gel electrophoresis. A 15 /C(g 
sample of purified insert DNA is th'en modified by 
suspending the DNA in water to which is added a 
concentrated solution of salts such that the final 
composition contains 70mM tris pH 8.8, 70r,M MgCl 2 lOmM 

35 dithiothreitol and 13.75 units of T4 poly: ?rase (P-L 

Biochemicals, Milwaukee, Wisconsin) in a • ital volume of 

BAD ORIGINAL 
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250 /tl. The reaction mixture is incubated 3.3 minutes 
at 37°C. To the reaction mixture is then added dTTP / 
50 mM, to terminate endonucleolytic digestion at the next 
adenine residue in the sequence. After an additional 
5 0.5 minutes' incubation, the reaction mixture is 

transferred to an ice-bath and the enzyme is inactivated 
by heat treatment for five minutes at 65°C. The treated 
DNA is recovered by ethanol precipitation. Digestion 
with SI nuclease to provide blunt ends is carried out as 
10 described by Ullrich, A., et al.. Science , 196 , 1313 
(1977). 

The translated region of pLaSl begins with an A 
residue, as shown in Figure 3. The nearest A residue in 
the 5 '-untranslated region is five residues prior to the 

15 start codon (AUG) . The above-described digestion 
conditions are designed to remove approximately 30 
nucleotides from the 3* end. Therefore, a significant 
proportion of the T4 polymerase -digested DNA molecules 
will be terminated at the start codon at the position 

20 numbered -107. 

(B) Example 7(A) is repeated in which the 
insert for Prepro SI is removed by digestion of the 
transfer vectors prepared in Examples 3, 4, 5 and 6 by 
the appropriate restriction enzyme prior to digestion 

25 with T4 polymerase. The transfer vectors and 

restriction enzymes utilized are shown in Table 3. 

Table 3 

Restriction Enzyme Transfer Vector (Example) 

Pst I 3A, 3B, 4G, 4H 

30 Hind III 4 A, 4B, 6 

Sal I 4C, 4D 

Bam HI 4C, 4D 

Eco RI 4E, 4F, 5A, 5B, 5C 
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The remaining steps are as described in Example 7 (A) and 
identical results are obtained in each instance. 

Example 8 

Synthesis of DNA coding for Prepro S2 
5 The plasmid pBR322/pLaS2 or the other transfer 

vectors containing the Prepro S2 insert prepared in the 
preceding examples is digested with the appropriate 
restriction enzyme as indicated in Example 7(A) or 7(B) 
and purified by preparative gel electrophoresis . 

10 Removal of the 5* -untranslated region is performed using 
T4 polymerase as described in Example 7(A) except that it 
is conducted in the presence of dATP. The reaction is 
stopped as described in Example 7(A). The DNA is 
extracted with phenol and precipitated. Excess dATP and 

15 the digested bases are removed by chromatography on 6-50 
Sepadex. T4 polymerase is added to the DNA and 
incubated for 0.25 minutes before dATP is added to stop 
the digestion at the T preceding the start codon. The 
reaction is stopped and the DNA is extracted and 

20 chromatographed as described above. A final digestion 
with T4 polymerase in the presence of dTTP is conducted. 
The treated DNA is recovered by ethanol precipitation and 
digested with SI nuclease to provide blunt ends as 
previously described* This yields a DNA which begins 

25 with the start codon at the position numbered -111. 

Example 9 
Synthesis of DNA coding for SI 
The insert coding for Prepro SI is removed by 
digestion of the transfer vectors prepared in the 
30 preceding Examples with the appropriate restriction 

enzymes as indicated in Example 7(A) or 7 (B). The insert 
is then digested with Xma I and purified by preparative 
gel electrophoresis. The purified insert DNA is then 
digested with T4 polymerase in the presence of dATP as 
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described in Example 7 (A) . The reaction is stopped, the 
treated DNA recovered and digested with SI nuclease to 
provide blunt ends as described in Example 7 (A) . This 
procedure yields a DNA coding for all of SI except the 
5 two N-terminal amino acids. A nucleotide sequence coding 
for these two amino acids is chemically synthesized by 
the procedure of Itakura, K. et al., J. Biol. Chem . 250 , 
4592 (1975) and Itakura, K. et al., J/ Am. Chem. SocT ~97, 
7326 (1975) . This sequence is then blunt-end ligated to 
10 the remainder of the SI coding sequence by the procedure 
of Valenzuela, P., et al.. Nature 280 , 815 (1979). This 
yields the complete coding sequence for SI. 

Example 10 
Synthesis of DNA coding for S2 

15 The insert coding for Prepro S2 is removed by 

digestion of the transfer vectors prepared in the 
preceding Examples with the appropriate restriction 
enzymes as indicated in Example 7 (A) or 7 (B) . The insert 
is then digested with the restriction enzyme Ava II or 

20 Hgi CII or Hgi Eli. A frament of DNA of about 120 base 
pairs in length which begins at the third base in the 
codon at position -15 of Prepro S2 and continues into the 
3* untranslated region is isolated by gel electrophoresis. 
This fragment is then digested with T4 polymerase as 

25 described in Examples 7 (A) and 9 using five cycles 
following the format described in Example 8. These 
cycles are conducted in the presence of dTTP, dCTP, 
dGTP, dTTP and dCTP, respectively. After the final T4 
polymerase digestion, the DNA is recovered by ethanol 

30 precipitation and digested with SI nuclease to provide 
blunt ends as described in Example 7 (A) . This yields 
the. complete coding sequence for S2. 
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Example 11 

(A) The deoxynucleotide sequence coding for 
Preipro SI prepared in Examples 7(A) and 7 (B) , i.e., 
Prespro SI beginning at the start codon, is inserted into 
5 transfer vectors by following procedures described in 
Examples 1, 3(A), 3(B), 4 (A) -(I), 5(A), 5(B) and 6. All 
conditions are identical to those described. The DNA is 
removed with the appropriate restriction enzyme, for 
example as shown in Table 3, and analyzed as described in 
10 Example 2. A DNA having the sequence beginning at the 

stajrt codon at position -107 and continuing to the end of 
the sequence shown for pLaSl in Figure 3 is obtained in 
eac5i instance. 

(B) The deoxynucleotide sequence coding for 
15 Prepro S2 prepared in Example 8, i.e., Prepro S2 

beginning at the start codon, is inserted into transfer 
veetors, removed and analyzed as described in Example 
11 (A) . A DNA having the sequence beginning at the start 

L L 

codon at position -111 and continuing to the end of the 
20 sequence shown for pLaS2 in Figure 3 is obtained in each 
instance. 

(C) * The deoxynucleotide sequence coding for 

51 prepared in Example 9 is inserted into transfer 
vectors, removed and analyzed as described in Example 

25 11(A). A DNA having the sequence shown for Si 
(Somatostatin I) in Figure 3 is obtained in each 
instance. 

(D) The deoxynucleotide sequence coding for 

52 prepared in Example 10 is inserted into transfer 
30 vectors, removed and analyzed as described in Example 

11 (A) . A DNA having the sequence shown for S2 
(Somatostatin II) in Figuge 3 is obtained in each 
instance. 
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Example 12 

The various coding regions described herein can 
be expressed by any of the methods described above. 

(A) Expression of Prepro SI as a fusion 

5 protein is demonstrated by incorporation of a radioactive 
amino acid into a protein not present in untransformed 
cells or by radioimmunoassay. In both instances, 
bacterial cells containing pBR322/pLaSl, prepared in 
Example 1, or bacterial cells containing pBR3 22 as the 

10 control are grown in nutrient broth. In the former 

procedure, after three hours of growth, 1.5 ml of cells 
are radioactively labelled by addition of 20/X. Ci 35 S-L 
methionine and incubated for 10 minutes. The cells are 
then collected by centrifugation, washed and resuspended 

15 in 250 jaI of buffer containing glycol 10% (v/v), J&- 
mercaptpethanol 5% (v/v), and SDS 2.3% (w/v) in 
0.0625M tris pH 6.8. The suspension is boiled for five 
minutes, then applied to a 10% (w/v) SDS-polyacrylamide 
gel and fractionated by electrophoresis. The protein 

20 bands are visualized by autoradiography. The results 
show the existence of a new protein band not observed in 
non-transformed cultures. 

In the radioimmunoassay procedure, the 
bacterial cells are collected after growth by centrifuga- 

25 tion. The cells are resuspended, lysed and radiolabeled 
antibody to Si that does not depend upon a free N- 
terminal group at position 1 of SI (Arimura, A., et al., 
Proc. Soc. Exp. Biol. Med . 148, 784 (1975)) is added. 
The immune complex is precipitated and radioactivity 

30 measured. A position reaction is obtained, indicating 
the production of a fusion protein -containing Prepro SI. 

(B) For the direct expression of Prepro SI, 
the insert pLaSl is modified as described in Example 
7(A) or 7(B) or the modified insert is removed from the 

35 transfer vectors prepared in Example 11(A) . 
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A vector for direct expression is constructed 
by modification of the plasmid ptrpE 30 by the removal of 
23-29 nucleotides using T4 polymerase and SI nuclease 
as described above. 

5 Both the modified pLaSl insert and the modified 

expression vector are provided with a specific linker 
oligonucleotide having the sequence 5 1 -CCGGATCCGG-3 ' on 
one strand and its complementary sequence on the other, 
by blunt end ligation using DNA ligase as described by 
10 Valenzuela et al., Nature , 280 , 815 (1979). The linker 
oligonucleotides provide restriction sites sensitive to 
Bam HI endonuclease which are employed to facilitate 
insertion. The insertion reaction is carried out 
essentially as described by Ullrich, A., et al., supra . 

15 Host bacteria, E. coli HB101, RRI or ^1776, are 
transformed by the recombinant vectors bearing the 
inserted modified Prepro SI coding region, and 
transformants are selected for resistance to ampicillih. 
A single transformant selected for further analysis is 

20 designated p trpE 30/PPSl . 

Bacterial cells transformed by p trpB 30/PPSl 
are grown in a standard r.lnimal medium (M9) supplemented 
with leucine, proline, vitamin Bl and ampicillin, at 
37 °C. In early log phase, the trp operon is induced by 

25 addition of -^-iridolylacrylic acid (30>^g/ml of medium). 
Control cultures are left uninduced. After three more 
hours of growth, 1.5 ml of cells are radioactively 
labeled by addition of 20 M Ci 35 S-L-methionine and 
incubation for 10 Minutes.. The cells are then collected 

30 by centrifugation, washed and resuspended in 250 Ml of 
buffer containing blycol 10% (v/v) , A -meircaptoethanol 
5% (v/v), and SDS 2.3% (w/v) in 0.0625M tris pH 6.8. 
The suspension is boiled for five minutes, then applied 
to a 10% (w/v) SDS-polyacryl amide gel and fractionated 

35 by electrophoresis. The protein bands are visualized by 
autoradiography. The results show the existence of a 
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new protein band of about 13,000 molecular weight, not 
observed in uninduced or non-trans formed cultures. 

Prepro SI is purified by conventional 
techniques including, for example, gel filtration, ion 
5 exchange chromatography, affinity chromatography, and 
differential solubility techniques. Prepro SI is con- 
verted to Pro SI by following the procedure described by 
Jackson, R.C., et al., supra . 

(C) Prepro S2 is expressed either as a fusion 
0 protein or directly by following the procedure described 

in Example 12(A) or 12 (B) , respectively, using pBR322/ 
pLaS2 or the modified pLaS2 insert, respectively. 
Production of a fusion protein containing Prepro S2 or 
production of Prepro S2 directly is determined by 
5 radioactive amino acid incorporation. The procedure is 
identical to that described in Example 12(A) or 12 (B) 
except that radiolabeled tyrosine is used in place of 
the radiolabeled methionine. The results show the 
existence of a protein band not observed in non- 
transformed cultures for the production of a fusion 
protein, and the existence of a protein band of about 
14,100 molecular weight not observed in uninduced or 

non-transformed cultures for the production of Prepro 
S2. 

(D) A specific linker having the sequence 
5" - CCGGATCCGGATG - 3 f on one strand and its 
complementary sequence on the other is blunt-end ligated 
to the DNA coding for SI cr S2 as prepared in Examples 

9 and 10, respectively. This modified DNA is then 
inserted in the modified plasmid ptrpE30 as described 
in Example 12(B) . Host bacteria are transformed and 
cultured and analyzed as described in Example 12(B) or 
12(C), respectively. The results indicate the 
production of Si or S2. This is confirmed by an amino 
acid analysis. 
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While the invention has been described in 

» 

connection with specific embodiments thereof # it will be 
understood that it is capable of further modifications. 
This application is intended to cover any variations, 
uses or adaptations of the invention following, in 
general, the principles of the invention and including 
such departures from the present disclosure as come 
within known and customary practice within the art to 
which the invention pertains. 



CLAIM S 



39 



0046669 



1. A somatostatin or somatostatin precursor 
selected from the group comprising preprosomatostatin-1, 
prosomatostatin-1, preprosomatostatin-2 , prosomato- 
statin-2 and somatostatin-2 . 

2. A DNA transfer vector comprising a 
deoxynucleotide sequence coding for a somatostatin or 
somatostatin precursor selected from the group comprising 
preprosomatostatin-1, prosomatostatin-1, preprosomato- 
statin-2, prosomatostatin-2 and somatostatin-2 . 

3. The DNA transfer vector of claim 2 wherein 
the somatostatin is somatostatin-2. 

4 ; The DNA transfer vector of claim 3 wherein 
said deoxynucleotide sequence comprises a plus strand 
having the sequence: 

5' - GCT GGC TGT AAG AAC TTC TAT TGG AAG GGC 
TTC ACT TCC TGT TAA - 3" 

wherein 

* 4 

A is deoxyadenyl, 
G is deoxyguanyl, 
C is deoxycytosyl and 
T is thymidyl. 

5. The DNA transfer vector of claim 2 wherein 
the somatostatin precursor is preprosqmatostatin-1. 

6. The DNA transfer vector of claim 5 wherein 
said deoxynucleotide sequence comprises a plus strand 
having the sequence: 

5 f - ATG AAG ATG GTC TCC TCC TCG CGC CTC CGC 

TGC CTC CTC GTG CTC CTG CTG TCC CTG ACC GCC TCC ATC AGC 

TGC TCC TTC GCC GGA CAG AGA GAC TCC AAA CTC CGC CTG CTG 

CTG CAC CGG TAC CCG CTG CAG GGC TCC AAA CAG GAC ATG ACT 

CGC TCC GCC TTG GCC GAG CTG CTC CTG TCG GAC CTC CTG CAG 

GGG GAG AAC GAG GCT CTG GAG GAG GAG AAC TTC CCT CTG GCC 

GAA GGA GGA CCC GAG GAC GCC CAC GCC GAC CTA GAG CGG GCC 

GCC AGC GGG GGG CCT CTG CTC GCC CCC CGG GAG AGA AAG GCC 

GGC TGC AAG AAC TTC TTC TGG AAA ACC TTC ACC TCC TGC TGA 
- 3' 
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wherein 

A is deoxyadenyl, 
G is deoxyguanyl, 
C is deoxycytosyl and 
T is thymidyl. 

7. The DNA transfer vector of claim 2 wherein 
the somatostatin precursor is preprosomatostatin-2. 

8. The DNA transfer vector of claim 7 wherein 
said deoxynucleotide sequence comprises a plus strand 
having the sequence: 







5 


1 - ATG CAG ' 


TGT ATC ( 


CGT TGT CCC GCC ATC ' 


TTG 


GCT 


CTC 


CTG 


GCG 


TTG 


GTT 


CTG 


TGC 


GGC 


CCA 


AGT 


GTT 


TCC 


TCC 


CAG 


CTC 


GAC 


AGA 


GAG 


CAG 


AGC 


GAC 


AAC 


CAG 


GAC 


CTG 


GAC 


CTG 


GAG 


CTG 


CGT 


CAG 


CAC 


TGG 


CTG 


CTG 


GAG 


AGA 


GCC 


CGG 


AGC 


GCC 


GGA 


CTC 


CTG 


TCC 


CAG 


GAG 


TGG 


AGT 


AAA 


CGG 


GCG 


GTG 


GAG 


GAG 


CTG 


CTG 


GCT 


CAG 


ATG 


TCT 


CTG 


CCA 


GAG 


GCC 


ACG 


TTC 


CAG 


CGG 


GAG 


GCG 


GAG 


GAC 


GCG 


TCC 


ATG 


GGA 


ACA 


GAA 


GGA 


CGG 


ATG 


AAC 


CTA 


GAG 


CGG 


TCC GTG .GAC TCT ACC 


AAC 


AAC 


CTA 


CCC 


CCT 


CGT 


GAG 


CGT 


AAA 


GCT 


GGC 


TGT 


AAG 


AAC 


TTC 


TAT 


TGG 


AAG 


GGC 


TTC 


ACT 


TCC 


TGT 


TAA 


- 3» 





















wherein 

A is deoxyadenyl, 
G is deoxyguanyl, 
C is deoxycytosyl and 
T is thymidyl. 

9* The DNA transfer vector of any of claims 2 
to 8 wherein said deoxynucleotide sequence comprises a 
DNA strand having a deoxynucleotide sequence complementary 
to the ribonucleotide sequence of mRNA coding for 
said somatostatin or somatostatin precursor* 

10. The transfer vector of claim 5 or 7 
comprising pBR322/pLaSl or pBR322/pLaS2 . 

11. A microorganism transformed by the transfer 
vector of any of claims 2 to 10. 

12. The microorganism of claim 11 selected 
from the group comprising Escherichia coli HB101, RRl or 

#1776. 
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13, A process for preparing a DNA transfer 
vector for use in maintaining and replicating a 
deoxynucleotide sequence coding for a somatostatin or a 
somatostatin precursor selected from the group comprising 

5 preprosomatostatin-1, prosomatostatin-1, preprosomato- 
statin-2, prosomatostatin-2 and somatostatin-2 
characterized by reacting said deoxynucleotide sequence 
with a DNA molecule prepared by cleaving a transfer 
vector with a restriction enzyme. 

14. The process of claim 13 wherein said 
deoxynucleotide sequence codes for somatostatin-2. 

.15. The process of claim 14 wherein said 
deoxynucleotide sequence coding for somatostatin-2 
comprises a plus istrand having the sequence: 

5 • - GCT GGC TGT AAG AAC TTC TAT TGG AAG GGC 
5 TTC ACT TCC TGT TAA - 3 1 

wherein 

A is deoxyadenyl, 
G is deoxyguanyl, 
C is deoxycytosyl and 
10 T is thymidyl. 

16. The process of claim 13 wherein said 
deoxynucleotide sequence codes for preprosomatostatin-1. 

17. The process of claim 16 wherein said 
deoxynucleotide sequence coding for preprosomatostatin-1 
comprises a plus strand having the sequence: 

5 1 - ATG AAG ATG GTC TCC TCC TCG CGC CTC CGC 
5 TGC CTC CTC GTG CTC CTG CTG TCC CTG ACC GCC TCC ATC AGC 
TGC TCC TTC GCC GGA CAG AGA GAC TCC AAA CTC CGC CTG CTG 
CTG CAC CGG TAC CCG CTG CAG GGC TCC AAA CAG GAC ATG ACT 
CGC TCC GCC TTG GCC GAG CTG CTC CTG TCG GAC CTC CTG CAG 
GGG GAG AAC GAG GCT CTG GAG GAG GAG AAC TTC CCT CTG GCC 
10 GAA GGAGGACCC GAG GAC CGG CAC GCC GAC CTA GAG CGG GCC 
GCC AGC GGG GGG CCT CTG CTC GCC CCC CGG GAG AGA AAG GCC 
• GGC TGC AAG * AAC TTC TTC TGG AAA ACC TTC ACC TCC TGC TGA 
- 3* 

wherein 
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A is deoxyadenyl, 
G is deoxyguanyl, 
C is deoxycytosyl and 
T is thymidyl. 

18. The process of claim 13 wherein said 
deoxynucleotide sequence codes for preprosomatostatin-2 . 

19. The process of claim 18 wherein said 
deoxynucleotide sequence coding for preprosomatostatin-2 
comprises a plus strand having the sequence: 

5 1 - ATG CAG TGT ATC CGT TGT CCC GCC ATC TTG 
5 GCT CTC CTG GCG TTG GTT CTG TGC GGC CCA AGT GTT TCC TCC 
CAG CTC GAC AGA GAG CAG AGC GAC AAC CAG GAC CTG GAC CTG 
GAG CTG CGT CAG CAC TGG CTG CTG GAG AGA GCC CGG AGC GCC 
GGA CTC CTG TCC CAG GAG TGG AGT AAA CGG GCG GTG GAG GAG 
CTG CTG GCT CAG ATG TCT CTG CCA GAG GCC ACG TTC CAG CGG 
10 GAG GCG GAG GAC GCG TCC ATG GCA ACA GAA GGA CGG ATG AAC 
CTA GAG CGG TCC GTG GAC TCT ACC AAC AAC CTA CCC CCT CGT 
GAG CGT AAA GCT GGC TGT AAG AAC TTC TAT TGG AAG GGC TTC 
ACT TCC TGT TAA - 3» 

wherein 

15 A is deoxyadenyl, 

G is deoxyguanyl, 

C is deoxycytosyl and , 
T is thymidyl. 

20. A process for making the plasmid pBR322/ 
pLaSl or pBR322/pLaS2 according to claim 13 characterized 
in that the DNA molecule prepared by cleaving a transfer 
vector with a restriction enzyme is pBR322 and the 

5 restriction enzyme is Pst I. 

21. A process for making an expression 
transfer vector in accordance with claim 13 for use in 
expressing a deoxynucleotide sequence coding for a 
somatostatin or somatostatin precursor characterized in 

5 that the DNA molecule prepared by cleaving a transfer 
vector with a restriction enzyme is cleaved at a point 
within an expressible control region. 
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22. A process for making a fusion protein 
comprising the amino acid sequence of a somatostatin or 
a somatostatin precursor as its C-terminal end and a 
portion of a procaryotic protein as its N-terminal end 

5 characterized by incubating a microorganism transformed 
by an expression vector prepared in accordance with 
claim 21.. 

23. A process for synthesizing a somatostatin 
or a somatostatin precursor selected from the group 
comprising preprosomatostatin-1, preprosomatostatin-2 
and somatostatin-2 characterized by incubating a micro- 

5 organism transformed by an expression vector comprising 
a deoxynucleotide sequence coding for said somatostatin 
, or, somatostatin precursor prepared in accordance with 
any of claims 14 to 19 under conditions suitable for 
expression of said deoxynucleotide sequence and purifying 
10 said somatostatin or somatostatin precursor from the 
lystate or culture medium of said microorganism. 
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CLAIMS 

1. A process for making a DNA transfer vector 
for use in maintaining and replicating a deoxynucleotide 
sequence coding for a somatostatin or a somatostatin 
precursor selected from the group comprising prepro- 

* 

5 somatostatin-1, prosomatostatin-1 , preprosomatostatin-2 , 
prosomatbstatin-2 and somatostatin-2 characterized by 
reacting said deoxynucleotide sequence with a DNA 
molecule prepared by cleaving a transfer vector with a 
restriction enzyme* 

2. The process of claim 1 wherein said 
deoxynucleotide sequence codes for somatostatin-2 • 

3. The process of claim 2 wherein said 
deoxynucleotide sequence coding for somatostatin-2 
comprises a plus strand having the sequence: 

5* - GCT GGC TGT AAG AAC TTC TAT TGG AAG GGC 
5 TTC ACT TCC TGT TAA - 3' 

wherein 

A is deoxyadenyl, 
6 is deoxyguanyl, 
C is deoxycytosyl and 
10 T is .thymidyl. 

4. The process of claim 1 wherein said 
deoxynucleotide sequence codes for preprosomatostatin-*l % 

5. The process of claim 4 wherein said 
deoxynucleotide sequence coding for preprosomatostatin-1 
comprises a plus strand having the sequencer 

* 

5' - ATG AAG ATG GTC TCC TCC TCG CGC CTC CGC 
5 TGC CTC CTC GTG CTC CTG CTG TCC CTG ACC GCC TCC ATC AGC 
TGC TCC TTC GCC GGA CAG AGA GAC TCC AAA CTC CGC CTG CTG 
CTG CAC CGG TAC CCG CTG CAG GGC TCC AAA CAG GAC ATG ACT 
CGC TCC GCC TTG GCC GAG CTG CTC CTG TCG GAC CTC CTG CAG 
GGG GAG AAC GAG GCT CTG GAG GAG GAG AAC TTC CCT CTG GCC 
10 GAA GGA GGA CCC GAG GAC GCC CAC GCC GAC CTA GAG CGG GCC 
GCC AGC GGG GGG CCT CTG CTC GCC CCC CGG GAG AGA AAG GCC 
GGC TGC AAG AAC TTC TTC TGG AAA ACC TTC ACC TCC TGC TGA 
- 3" 



wherein 

A is deoxyadenyl, 
G is deoxyguanyl, 
C is deoxycytosyl and 
T is thymidyl. 

6. The process of claim 1 wherein said 
deoxynucleotide sequence codes for preprosomatostatin-2. 

7. The process of claim 6 wherein said 
deoxynucleotide sequence coding for preprosomatostatin-2 
comprises a plus strand having the sequence: 

5 1 - ATG CAG TGT ATC CGT TGT CCC GCC ATC TTG 



GCT 


CTC 


CTG 


GCG 


TTG 


GTT 


CTG 


TGC 


GGC 


CCA 


AGT 


GTT 


TCC 


TCC 


CAG 


CTC 


GAC 


AGA 


GAG 


CAG 


AGC 


GAC 


AAC 


CAG 


GAC 


CTG 


GAC 


CTG 


GAG 


CTG 


CGT 


CAG 


CAC 


TGG 


CTG 


CTG 


GAG 


AGA 


GCC 


CGG 


AGC 


GCC 


GGA 


CTC 


CTG 


TCC 


CAG GAG TGG 


AGT 


AAA 


CGG 


GCG 


GTG 


GAG 


GAG 


CTG 


CTG 


GCT 


CAG 


ATG 


TCT 


CTG 


CCA 


GAG 


GCC 


ACG 


TTC 


CAG 


CGG 


GAG 


GCG 


GAG 


GAC 


GCG 


TCC 


ATG 


GGA 


ACA 


GAA 


GGA 


CGG 


ATG 


AAC 


CTA 


GAG 


CGG 


TCC 


GTG 


GAC 


TCT 


ACC 


AAC 


AAC 


CTA 


CCC CCT 


CGT 


GAG 


CGT 


AAA 


GCT 


GGC 


TGT AAG 


AAC 


TTC 


TAT 


TGG 


AAG 


GGC 


TTC 


ACT 


TCC 


TGT 


TAA 


-3' 





















wherein 

A is deoxyadenyl, 
G is deoxyguanyl, 
C is deoxycytosyl and 
T is thymidyl. 

8. A process for making the plasmid pBR322/ 
pLaSl or pBR322/pLaS2 according to claim 1 characterized 
in that the DNA molecule prepared by cleaving a transfer 
vector with a restriction enzyme is pBR322 and the 
restriction enzyme is Pst 1. 

9. A process for making an expression 
transfer vector in accordance with claim 1 for use in 
expressing a deoxynucleotide sequence coding for a 
somatostatin or somatostatin precursor characterized in 
that the DNA molecule prepared by cleaving a transfer 
vector with a restriction enzyme is cleaved at a point 
within an expressible control region . 
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10. A process for making a fusion protein 
comprising the amino acid sequence of a somatostatin or 
a somatostatin precursor as its C-terminal end and a 
portion of a procaryotic protein as its N-terminal end 
characterized by incubating a microorganism transformed 
by an expression vector prepared in accordance with 
claim 9 . 

11. A process for synthesizing a somatostatin 
or a somatostatin precursor selected from the group 
comprising preprosomatostatin-1, preprosomatostatin-2 
and somatostatin- 2 characterized by incubating a micro- 
organism transformed by an expression vector comprising a 
deoxynucleotide sequence coding for said somatostatin or 
somatostatin precursor prepared in accordance V7ith any 

of claims 2 to 7 under conditions suitable for 
expression of said deoxynucleotide sequence and purifying 
said somatostatin or somatostatin precursor from the 
lysate or culture medium of said microorganism. 



» 
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FIG. I 
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3/3 CAUC4 



CAUCCGCAGACGCCCCCACACCUACAGACAUCACGU6 
• • ••••• # * •••••» •*••«• 

CACACACAAACCCACCACAACCAtiUACAACCACCACAACACACCACACCACCACACACO: : : it 
-107 -100 

met lys net val set ser ser arg leu erg cys Jcu leu va] leu leu 
AUG AAG AUG COC UCC OCC UCC CCC CUC CCC UGC CUC CUC CUC CUC COG 

•* • * • »#» • • • Jr 

AUG CAG UCU AUC CGU UCU CCC CCC AOC OUG GCU CUC CUG CCC UUG CUU 
net gin cys ile arg cys pro ala lie leu ala leu leu Ala leu val 
-111 -110 -loo 

-90 -bo 

leu ser leu tht Ala ser lie set cys ser phe ala gly gin org asp 

CUG uXC COG ACC CCC UCC AUC ACC UGC. UCC UUC CCC CCA CAG AGA CAC 
••• • • • * * ••••*•*• 

CUG UCC CCC CCA AGU CUU UCC UCC CAG CUC GAC AGA CAG CAG ACC GAC 
leu cys gly pro ser val ser ser gin leu asp arg glu gin ser asp 

-90 . eo 

-70 

ser lys leu arg leu leu leu his arg tyr pro leu gin gly ser 
UCC AAA ::: CUCCGC CUG CUC CUG CAC CGG UAC CCG CUG CAG GGC UCC 
* * •** • •*• • *•* * « * ** * * * • 

AAC CAG GAC CUG CAC CUC CAC CUG CGU CAG CAC UGG CUG CUG GAG ACA 
asn gin asp leu asp leu glu leu arg gin his trp leu leu glu arg 

-70 

-60 

* v f 9ln asp n*t thr arg ser ala leu 

AAA :: : : : : : : : : : : ::: ; : : : : i CAG CAC AUG ACU CGC UCC CCC UUG 

♦*# •* • * * ** •* 

GCC CGG ACC CCC CCA CUC CUG UCC CAG GAG UGG AGU AAA CGG CCG GUG 
ala arg ser ala gly leu leu eer gin glu trp ser lys arg ala val 
-60 .so 

-50 _ l0 
ala glu leu leu leu ser asp leu leu gin gly glu asn glu ala leu 
CCC GAG CUG CUC CUG UCG GAC CUC CUG CAG CCG GAG AAC CAG GCU CUG 
• **• •* * *** * # * • • ft 

GAG GAG CUG CUG GCU CAG AUG UCU CUG CCA GAG GCC ACG UUC CAC CGG 
glu glu leu leu ala gin met ser leu pro glu ala thr phe gin arg 

-30 . 20 
9lu glu glu asn phe pro leu ala glu gly gly pro glu asp ala his 
GAG GAC GAG AAC UUC CCU CUC GCC GAA GGA GGA CCC GAC GAC GCC CAC 

GAG CCG GAG GAC GCG UCC AUG CCA ACA GAA GGA CGG ::: 

arg 
-20 



GAG GCG GAG GAC GCG UCC AUG CCA ACA GAA GGA CGG ::: ::: ts: s -- r* I S% -m 
9l« «la glu asp ala ser met ala thr glu gly arg " I IV J 



-10 

ala asp leu glu arg ala ala ser gly gly pro leu leu ala pro arg 
GCC GAC CUA GAG CGG GCC GCC AGCGGGGCCCCUCUCCUCCCCCCCCGG 
** *** *** *** •* • * *«••*•'«• 

AUG AAC CUA GAG CGC UCC GUG GAC UCU ACC AAC AAC CUA CCC CCU CGU 
met asn leu glu arg ser val asp ser thr asn asn leu pro pro arg 

-10 

Somatostatin I 

X 10 
glu arg lys ala gly cys lys asn phe phe trp lys thr phe thr ser 
GAG AGA AAG GCC GGC UGC AAG AAC UUC UUC UGG AAA ACC UUC ACC UCC 
•** * *• *• •** **• *•• * *« • «* 

CAG CCD AAA CCU GGC UGU AAG AAC UUC UAU UGG AAG GGC UCC ACU UCC 
9lu arg lys ala gly cys lys asn phe tyr trp lys gly phe thr ser 

i 1 10 

. Somatostatin U " ! 

cys OP 

UGC UGA CCCCCUCCTCAllCCUCGCCGCUCCurCCuCl^ 2 : : : : ACAC 

•• • * ••••• • * *•* * ***** ** • * 

UGU UAA ::AGCUCCGCCCACCCAAAGCUACACCGrc 
cys OC 
14. 



ACCCu^AAlX^AUCCCGGUUUGCAGCUCCUCC^ 

•* ••••• • • * •* • *• * *« * ## « #fl . 

ACCCuCAAAUUCACCUGAACAACUGCACCGACC AAUCAGC ACCUC : UCCGG AUGGAAUGU ACC 



UGAAACUAUUUUUAAircGmJGGUUUGAAUAAAAAUCUCUUUCAGAC Puly A 
••** •" • • * • • •••» • •• 

UCAAUAAA; i : : sUAAUACUGtlUAuCAAUUAAAGCGAUAAAUCCGUC Ibly A 
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